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This study, sponsored by the Cooperative VJashinston Education Center, 
at4:eui}ts to clarify the hlc;hly coiaplex probleia of the assessment and 
evaluation of teachers. Although it is compri^^^d laostly of non-eiapirical 
data and lacks statistical sophistication, it can bo a valuable IndeK of 
conanon practices in the daily operation of school districts* 

The study further examines how the intent of yaehington law KCil 28A. 
72*030 is being carried out by school districts with regard to teacher 
assessment and evaluation. 

Finally, the study cites some of the implications of teacher assessment 
and evaluation » and offers seme practical suggestions in developing your 
own evaluation programs* 
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THE WAY WE SEE IT: A Survey of Teacher Evaluation 
Policies and Practices Operant in the State of W/^shington 



CHAPTER I 
INTRODUCTION 



The past decade has seen education enveloped in cnyriad crises causing difisipation 
of public confidence in the schools. Disparate views concerning the role of public 
education ranged from those who were accustomed to viewing the educational institution 
as a societal godsend with the antithetical view increasingly being expressed by doubting 
the schools' ability to meet pending social needs. The current financial crisis con* 
fronting public school educators has become a prominent national concern and has in part 
led to a mandate to devise assessment techniques which yield the type of data which 
Jn^aningfully enhances public knowledge concerning the effectiveness of current programs 
and practices* Thusly^ this concern has forced educators to consider the assessment 
problem and subsequent evaluative procedures which have never before been widely used 
by the education community. 

Apart from the obvious and immediate application to instructional program effective^ 
ness the mandate to become increasingly accountable has extended to teacher effective- 
ness. Nationally the aforementioned trend seems incontrovertable while in our own 
State of Washington recent state law has sought to clarify the responsibility of asseS** 
sing teacher effectiveness: 

Every board of directors ^ in accordance with procedure provided in 
RCW 28A. 72. 030 , shall establish an evaluative criteria and procedures 
for all certificated employees. Such procedure shall require not less 
than annual evaluation of all employees. New employees shall be evalua* 
ted within the first ninety calendar days of their employment. Every 
employee whose work is judged unsatisfactory shall be notified in writing 
of stated areas of deficiencies along with recommendations for improve- 
ment by February 1st of each year. A probationary period shall be estab* 
lished from February 1st to April 15th for the employee to demonstrate 
Improvement . 

While public insistence has intensified the need for accountability procedures^ 
efforts at formulating evaluative tools had already begun as a direct result of change 
in education. The proliferation of teaching strategies, the increased monetary appro- 
priations through federally administered Title programs, and the wealth of educational 
materials decidedly makes education in the State of Washington a non static entity » 
In this transitional environment ^ decisions must be evaluated^ organizational procedures 
must be appraised ^ and materials must be tested. Change, if It denotes progress » must 
be accompanied by evaluation, out of which should emerge the basis for all subsequent 
change. 

I. THE PROBLEM 

Statement of the Problem 

The present study attemqpts by means of questionnaire to survey teacher evaluation 
policies and practices operant in First Class Districts in Washington State, and to 
report various concerns arising from these evaluation programs as currently conductedt 
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Purpose of the Study 

In October, 1971, district coordinators for the Cooperative Washington Education 
Centers assembled at Central Washington State College to discuss common educational 
concerns including that of teacher accountability. As a result of this meeting, the 
Cooperative Center commissioned this survey in the hopes of assisting school districts 
in their development of more effective teacher evaluation programs. The purpose of 
the survey is to enable school districts in Washington State to learn of diverse eva- 
luative methods currently employed, the primary intent being to provide a background 
on which districts might revise their evaluation programs as they deem advisable. It 
is anticipated further, that ideas gleaned from the survey will ultimately effect 
instructional improvement by encouraging school districts to devise evaluation pro- 
grams conducive to professional grov;th on the part of the teacher* 

II. DEFINITION OF TERMS 

For the purpose of this study the following terms have been defined. 

First Class School District 

A First Class School District is a school district having in excess of 10,000 
population as shown by regular census or any other evidence proved acceptable by 
the local Intermediate School District. 

Cooperative .Washin5^ton Education Center 

The Cooperative Washington Education Center is an organization comprised of 
twenty-four institutional participants who benefit from a spirit of mutual coopera- 
tion and concern. The Cooperative Center in its attempts to promote needed change 
and to reduce the duplication of efforts in school districts, has developed means of 
providing for an exchange of innovative ideas. The Educational Practices Inventory, 
a compilation of innovative programs operant in the member centers, and the annual 
in-service workshop allow for a sharing and dissemination of applicable practices. 
Through a shared services program, districts may request, at a no-cost basis, other 
district personnel possessing expertise in curricular content areas and/or in educa- 
tional administration. The Principal's Leadership Tr-Tiining Project, initiated in 
1970, assists participating principals in developing administrative competencies 
essential to their positions. Representing indireccly one-quarter of the State's 
pupil enrollment, the Cooperative Washington Education Center has from the onset 
been concerned with the dissemination of innovative programs and practices through 
hosting an annual statewide education conference and publishing a nxunber of mono- 
graphs, pamphlets, and newsletters. 

Evaluation Programs 

Numerous districts reported their evaluation programs of teachers under current 
review and/or revision. The data in this survey reflects policy initiated prior to 
1972 with one exception: if a district indicated an experimental or pilot program 
had been implemented and its formal adoption by the district was probable, then this 
new program was included in the study rather than the district's previously stated 
policy- 

Teacher Accountability 

Teacher accountability refers to the assessment of teacher performance as a causal 
factor in the pupils' achievement of desired objectives in light of resources provided. 



III. LIMITATIONS OF THF STUDY 



This study v/as dependent upon the information and data submitted by the First 
Class Districts surveyed. Conclusions are drax;n from the approximate sixty per 
cent of all First Class Districts responding to the survey. This study describes 
these districts' policies governing teacher evaluation. It does not state how 
assiduously these policies are enforced or carried out, nor does it report on the 
relative success of tlie various evaluation programs • 



CMPTER II 
REVIEW OF THE LITERATURE 



Few topics have been as thoroughly researched as the subject of evaluating teacher 
effectiveness. As Biddle noted in his preface to Contemporary Research on Teacher 
Effectiveness : 

Probably no aspect of education has been discussed with greater frequency, 
with as much deep concern, or by more educators and citizens than has that 
of teacher ef f ectiveness--how to define it, hov; to identify it, hov; to 
measure it, how to evaluate it, and how to detect and remove obstacles to 
its achievement (5:4), 

Yet the research on teacher effectiveness has produced deplorably few results- 
The persistent problem of determining v^hat teaching leads to what learning continues 
to plague administrators and teachers alike. Years of research and experience have 
left many educators with serious doubts concerning the efficacy of teacher evalua- 
tion efforts. Only by reviewing the literature can one appreciate the burden that 
has fallen to education. 

Prior to and during the 1930 's few educators denied that one could competently 
judge a teacher's proficiency by observing his action in the classroom. Consequently 
rating scales were devised to assist supervisory personnel in measuring teacher 
efficiency. These rating scales typically consisted of items relating to the tea- 
cher's social relations, instructional skills, personal characteristics, and pro- 
fessional qualifications. Researchers, meanwhile, were consistently finding that 
the criteria utilized in the rating scales were exceedingly arbitrary and vague, 
Reavis and Cooper (1945), after reviewing rating devices of 103 school systems, 
concluded: "Ratings appear to be invalid, then as a comprehensive measure of either 
general or specific teacher ability" (5:61). Anderson (1954) confirmed the earlier 
conclusions drawn by Hellfritzch (22) and Lins (26) by testing eight different 
evaluative criteria: ratings by the principal, by peers, by pupils, by the teacher 
herself, by an outside agency, and by pupil achievement scores. He declared in his 
summary: 

No correlation appreciabl y different from zero was discovered between the 
evaluations of the teachers on the different rating scales and the evaluations 
based on the achievements of their pupils in the subject matter areas (2:68). 

Numerous other studies substantiated Anderson's conclusion. Statements similar to 
Rabinowitz and Travers, "No teacher is more effective than another except as some- 
one so decides and designates, . (17:37) abound in the literature. 
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Walker, almost ttjenty years before Anderson, described the core difficulty in 
evaluating teacher effectiveness, and, in so doing, exemplified the i:revalent at- 
titude of that period: 

The lack of an adequate, concrete, objective, universal criterion for teaching 
ability is thus the primary source of trouble for all who would measure teaching. 
One typical method of attack used in rating scales is to compile a list of broad 
general traits supposedly desireable for teachers, with respect to which the 
rater passes judgment on each teacher. . . Even when the scale is made quite 
specific, relating not to general traits but to concrete procedure, the funda- 
mental difficulty remains, that there is no external and generally accepted 
Criterion against v^hich the scale can be validated to establish the significance 
of its items (l6:602) . 

This passage implies that there is some variable for measuring teacher "^fective- 
ness which applies to all teachers regardless of level, situation, or subjt-jt. This 
single criterion has yet to be defined, allovjing Morsh and Wilder (1954) to confi- 
dently report: 

No single, specific, observable teacher act has yet been found whose frequency 
or percent of occurrence is invariably and significantly correlated x^ith student 
achievement (13 : 1423) . 

Gage, in his comprehensive analysis of this 'criterion of effectiveness' para- 
digm, explained that, even if criteria viere found to be related to teacher effective^ 
ness, there would be disagreement as to the ultimacy of the criteria. For example, 
which of these criteria v/ould contain more weight: the teacher's effect on pupil 
achievement of current educational objectives; or the teacher's effect on pupil 
achievement in subsequent schooling? As decades passed, the search for a universal 
criterion gradually ebbed. Gage, in his Handbook of Research on Teaching > eulogized 
such efforts perhaps most fittingly: 

Research by this paradigm (criterion of effectiveness) has been abundant; hun- 
dreds of studies . . . have been made. In the large, these studies have yielded 
disappointing results: correlations that are nonsignificant, inconsistent from 
one study to the next, and usually lacking in psychological and educational 
meaning (15:18). 

This vein in (i^hat we might term) the ^'Accountability Mountain" had been mined out 
and abandoned, i^ith researchers looking elsewhere for more profitable diggings. 

Objectivity continued to be the prospector's pick, and the introduction of merit- 
pay proposals served to intensify the urgency for developing objective standards to 
measure teacher performance. Once again, exhaustive state-wide studies achieved 
only minimal success in definiag behaviors which were reliably associated with pu- 
pil achievement. In his state-by-state analysis, McPhail stated: 

As indicated in the literature, several states (Delaware, Tennessee, Mississip- 
pi » New York, North Carolina, and Utah) have been concerned and involved in at- 
tempts to evaluate teachers by factors in addition to degree and experience, . , 
most have proven unsuccessful in determining facts of and/or procedures that 
have proven to be workable; therefore, in most instances, they have been dis- 
continued (27:356). 

Another approach used to measure teacher effectiveness has been that of compar- 
ing some teacher characteristic or personality trait with an outcome variable. 
Generally, researchers found few correlations between teacher variables (sex, age, 
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socio-economic background, acader.:lc achievement, marital status, intelligence, 
and voice quality) and teacher effectiveness. Ryans cov.^ icted an indepth study 
of the charactciristics of teachers, but failed to arrive at any variable which 
uas correlated significantly with effectiveness in the classroom. He attributed 
the reason to the wide variations in defining "good teacher," As he explained: 

A person's concept of a good teacher seems to depend on (a) his acculturation, 
his past experience, and the value attitudes he has come to accept, (b) tb 
aspects of teaching which may be foremost in his consideration at a given time, 
and (c) characteristics of the pupils taught (38:370-371), 

Still another approach hypothesized that teacher attitudes and values were re- 
lated to teacher effectiveness, and could serve also as adjunct predictors of 
teacher performance, Attitudinal inventories and personality tests, especially 
the Minnesota Teacher Attitude Inven tory (MTAI) and the Minnesota M ult iphasic 
Personality Inve ntory (t#ffl), attained notable prominence, although numerous critics 
(Munro (33), and Hedlund (17)) attacked their validity in predicting successful 
teachers. Both Rabinowitz and Coleman found the MTAI susceptible to ''faking,*' and 
found that the 'subject with some knov/ledge of the viewpoint endorsed by the se- 
lection agency could in most cases reflect this viewpoint in responding to the 
MTAI" (17:520), 

Heil and Washburne (21), however, asserted there were identifiable types of 
teachers, as classified by the Manifold Int erest Schedul e, and that the different 
types (three in their study) have different effects on the children they teach. 
For example, they discovered that pupils made the most progress in math and science 
under the '^Type A" (turbulent, impulsive, and variable) teacher than under the other 
two types. Similarly, Broudy reported that teachers who scored high on the MTAI 
tended to be more effective with students V7ho are less cognitively oriented, and 
suggested that the teacher be paired V7ith the pupils with whom she might be most 
effective (9), 

With the extreme paucity of solid evidence supporting evaluation of teacher 
effectiveness, the American Association of School Administrators, the Department 
of Classroom Teachers of NEA, and the National School Boards Association acknow- 
ledged in a joint publication. 

The notion of the "good teacher" so basic to study of teacher effectiveness 
turns out to be almost as vague and diffuse as the range of human experience 
relative to teaching (17:4), 

Increasingly, researchers concentrated on observing the interaction between 
teacher and pupil in the classroom. As Mitzel (1960) cautioned. 

In considering both teacher behavior and student behavior as process criteria 
it became clear that neither of them should be studied in isolation from the 
other. The interaction between them appears to be the dominant aspect of the 
whole process of learning (31:1484), 

Hosts of interaction-behavior indices appeared as researchers experienced a re- 
newed enthusiasm and optimism which permeated the decade of the sixties. Flanders * 
Interaction Analysis Procedure (14), Mork's Verbal Reaction Behavior Log (32), and 
Ober's Reciprocal Category System (34) are examples of instruments designed to 
record teacher and pupil behaviors, especially verbal behaviors, and to test the 
relationship between this behavior pattern and the attainment of educational 
objectives . 
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Ano^rson and Hunka, reviewing the litei:ture on teacher evaluation, ccnceaed: 

Studies by . . , Anderson, B 'wer and Reed, Theien, and Flankers have shown a 

relationship between certain :»orts of teacher behavior and extent of growth 

by students along certain des^reable intelleccual and attitudinal criteria (1:75). 

Despite the internal validity obtained in many interaction studies, researchers 
began to question hov; applicable and reliable the correlations V70uld be if the 
same teachers were observed in a different situation. As Barr commented: 

Teaching does not take place in a vacuum; it takes place in a tangible situation 
. , , time has seen the emphasis shift from the teacher per se to the teacher 
in relation to the more important aspects of a situation (3:141). 

Wick and Beggs concur by saying, "The question, 'What are the dimensions of an ef- 
fective teacher?* cannot be answered v/ithout asking, 'In what situation?"' (43:194). 
Even after Turner and Fattu (1:76) concluded that the most economic and systematic 
v/ay of conceptualizing the potential inexhaustible concept of teacher v;as profes- 
sional problem solving, Flanders responded: 

The tveight of the evidence from the studies reviewed suggests that problem 
solving performance is a measurable characteristic of teachers and that this 
characteristic changes under teacher preparation and experience but that its 
importance to teacher success is la rgely co ntingent on t h e type of set ting 
within t^hich the teacher does hi s work (author's emphasis) (13:1432). 

Soar 5 critical of the restrictive viei7 taken by past researchers similarly con- 
cluded: 

Once a multidimensional vie^^7 of the teaching- learning process is accepted, it 
seems likely that a teacher may do some things that are not effective; or even 
that a given act may be effective in i7orking toward one goal but not another; 
or that an act may be effective in teaching one child and ineffective with 
another (41:289). 

Such doubts have caused educational researchers like Stufflebeam (42) and Guba 
(18) to propose evaluation strategy designed to foster decision-making on the part 
of the teacher. This approach implies that the teacher is an active member in the 
accountability process and is more apt to change if he participates in the decision 
to change. In short, such procedures call for the teacher to assess a particular 
situation and then determine what he can and should do to increase his effectiveness 
in that situation. Since supervisors assist in an advisory capacity, much of the 
negative, threatening aura surrounding many evaluative procedures and instruments 
is dispelled. 

As a recent development, Gage suggests the use of 'microteaching' in determining 
the effectiveness of teacher performance. He explains the procedure as one of 
breaking down the global, complex variables explored previously, and selecting for 
study a single facet of teaching, such as explaining ability. The result, hope- 
fully, would be the emergence of a set of lat^ful relationships between variables, 
something unattainable by those who attempted to study the whole, unanalyKable 
process of teaching (16). According to 'micro- teaching , ' an instructor teaches 
a group of five individuals focusing on one aspect or facet of the role. Opponents 
to this approach have retorted sharply that a teacher does not teach five students 
in the same fashion as she does thirty. Also, the obvious artificiality of the 
O situation, they say, precludes its adaptability for measuring teacher effectiveness. 

ERIC 
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Current accountability efforts involving performance contracting and voucher 
systems (notably Texarkana and Cherry Creek programs) have sparked additional re- 
search into evaluation, A proponent of accountability, Leon Lessinger, in Every 
Kid a Winner : Acco unt ability in Education^ elucidates the concept of ' educational 
engineering.' In this volume, he presents a detailed description of the Texarkana 
performance contract project, and elaborates on the procedures required in evalua- 
ting a school's effectiveness (25)* 

Contrastingly, Stephen Hencley (23), in a soon to be published v7ork, presents 
a cogent examination of the impediments to accountability. He examines the philo- 
sophical, technological-economic, and political-legal deterrents to accountability. 
Hencley sees the intense concern for accountability as philosophically in conflict 
with the more open, affectively oriented, child-centered schools advocated by 
Silberman, Glasser and many others* In citing contemporary research on testing 
Hencley iterates a concern over the validity of product accountability as nox^ mea- 
sured. Hencley 's and Lessinge-r's articulate treatments of the problem of accounta- 
bility illustrate the wide disparity of viev7s held by those in the educational 
community • 

Conclusions 

Literature relevant to the present study can be divided into three major areas, 
referred to by Mitzel as presage variables, process variables, and product variables. 
Each of these categories has been investigated testing its relation to teacher 
effectiveness. From the review it was concluded that: 

1. Research linking presage variables \7.',th teacher effectiveness (referring to 
teacher characteristics, values, and attitudes) has produced few tangible results. 
Only very general trait designators such as ^'interested'^ and "sympathetic^* have 
been found to correlate significantly with teacher effectiveness. Contradictory 
conclusions and findings abound in the literature. Disagreement over the defini- 
tion of '*good teaching'' has contributed to the confusion surrounding the research, 
and has impeded efforts at identifying teacher trait?? related to effectiveness. 

2. Though a recent development, the study of process variables has produced a 
greater research objectivity due to its emphasis on measuring pupil and teacher 
behavior. Hov;ever, research results have been negligiblre, frequently positing only 
obvious patterns, for example, the positive relationship between teacher responsive- 
ness to pupil ideas aid subsequent pupil achievement. While researchers claim suc- 
cess in studying the relation of process to product variables, its time-consuming 
procedures are generally considered too impractical for implementation in the 
schools . 

3. Product variables, referring to the effects on pupil behavior or achieve- 
ment has received notev;orthy attention by proponents of accountability. Research 
conducted on the effectiveness of performance contracting is as yet incomplete. 
Meanwhile, critics of accountability procedures have intensified their efforts in 
finding inherent weaknesses in test standards used to validate product achievement. 
The current trend in research has shifted from correlating presage and process vari- 

j ables with product variables focusing now on the relation of product variables 

themselves and teacher effectiveness. 
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CHAFTER III 
METHODOLOGY AND RESULTS 



The purpose of this chapter is tv/ofold: fiist, to describe the sample se- 
lected and to clarify the procedures used in obtaining the data; secondly, to 
provide a description of the results derived from the information collected in 
this survey. 

Methodology 

This study intended to gather information and data pertinent to teacher evalu- 
ation programs conducted in first class school districts in Washington State, 

A questionnaire, explaining the intent of the survey and spo^xfying the data 
desired, was initially mailed to all first class school district superintendents 
in November and subsequently followed by a second mailin^^ in April, Of the sixty- 
eight first class districts, forty districts (approxin^itely 60 per cent) responded 
to the questionnaire . The data collected \7ere handcv^bulated and the conclusions 
expressed in this study reflect the interpretations and viev7S of the authors, 

Results 

Concern one ; To v;hat extent have school districts addressed themselves to the 
state mandate by adopting policy guidelines pertinent to teacher asseiisment? 

In this survey, 95 per cent of the districts reported having adopted district 
policy regarding teacher evaluation. The remaining 5 per cent, although adminis- 
tering a teacher evaluation program, stated they had no formal district policy 
governing it. 

Concern t\Jo : To \7hat extent have individual schools within first class dis- 
tricts formulated their own teacher evaluation procedures rather than employ 
a district-wide model? 

Data revealed that 82 per cent of the sample employed a district-v;ide evalua- 
tion model involving traditional methods of evaluation such as the rating checklist 
and specific comment instruments. Contrasced to the above finding, 10 per cent 
encouraged individual schools to develop their own evaluative programs. It was 
found that these schools, when developing teacher evaluation programs, tended to 
devise and irrplement procedures that varied from the typical principal rating 
used in most districts. These schools frequently instituted such evaluative pro- 
cedures as video-taping and pupil assessment of the teacher. Also, data indicated 
a much higher incidence of peer evaluation as an alternative to principal evalu- 
ation in schools having developed their o\m assessment models. 

An inverse relationship V7as found bet\7een a district's encouragement of diverse 
evaluative methods and its pupil enrollment or size. Thus, as a district increased 
in size, the likelihood of individual schools v7ithin that district developing their 
own evaluative procedures decreased* 

Concern three ; Since accountability emphasizes the attainment of educational 
objectives, this question was posec : to what extent have first class districts 
developed procedures for assessing achievement of performance objectives? Re- 
latedly, what proportion of districts utilize these procedures in their teacher 
evaluation programs? 
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The development of procedures for assessing attainment of educational ob- 
jectives was reported by 45 per cent of the districts • However, only 20 per cent 
had applied these procedures to their formal teacher evaluation programs. Another 
25 per cent of the sample cotrimented that their districts were presently engaged 
in the development of such procedures, but admitted the formulation of educational 
objectives was an arduous and complicated task. The remaining districts either 
responded negatively or omitted answering the question* 

Concern four ; Evaluation performs specific functions implying it is goal 

directed or purposive. In Soliciting information, this survey inquired: 

what is the expressed purpose(s) of your district's teacher evaluation program? 

Overwhelmingly, districts agreed that the primary purpose for their teacher 
evaluation program i-^as instructional improvement. Compared to SC per cent who 
selected this purpose, only 7 per cent cited legal compliance as the primary in- 
tent for their policy. 

Several districts reported more than a single or primary purpose for their 
evaluation program. Secondary reasons given concerned legal compliance, personnel 
retention (tenure, promotion, dismissal), professional grox^th, and staff communi- 
cation. 



ERIC 



Several respondents, noting that the primary or "intended" purpose of instruc- 
tional improvement V7as not necessarily the one accomplished, indicated that a pos- 
sible incongruity existed between the evaluative procedures or practices and the 
designated purpose. Conversely, another administrator whose district policy, he 
believed, related most to legal compliance, remarked that evaluators v;ere never- 
theless encouraged to accentuate a positive attitude when in conference v;ith 
individual teachers. A small minority of the sample, attempting to make the pur- 
pose of instructional improvement more clearly apparent, provided teachers with 
learning agents or co-helpers to assist in the development and achievement of per- 
formance objectives. 

Concern five ; Though teacher evaluation often assumes diverse forms and pro- 
cedures, the evaluator is a necessary component. The following results were 
found when districts responded to this question: what personnel are primarily 
responsible in your district for assessing a teacher's performance? 

The person most frequently reported to be assuming the role of teacher evalu- 
ator was the building principal. His involvement either as sole evaluator or as 
an evaluation team member was reported by 90 per cent of the districts. Only 
5 per cent of the districts allowed teacher self -evaluation coupled with princi- 
pal conference to suffice for the annual evaluation. The use of peer evaluators j 
completely excluding principal participation in the formal evaluation, was per- 
mitted in 5 per cent of the sample. Other districts encouraged peer evaluation 
as a supplementary practice for the teacher benefit, but it did not replace or 
substitute for the district's standard procedure • Another means of teacher as- 
sessment, that of pupil evaluation, was conducted in only three districts, and in 
each instance only on a temporary, experimental basis, or in a specific school 
at the teacher's request. 

Concern six ; Evaluators follow specific procedural guidelines when assessing 
the effectiveness or the performance of a teacher. This survey addressed it- 
self to this question: what procedures are utilized by evaluators to gather 
information that will serve as the basis for teacher assessment? 
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Data revealed that SO per cent of the districts required the evaluator to 
observe the teacher in her instructional role. There was a wide disparity in 
the number of observations required, but a sizeable 48 per cent of those using 
observation techniques as a basis for evaluation required but a single class- 
room visitation. Interestingly, more districts specified a minimum of three 
observations than tv;o , One perceives a dual trend for newer policies in this 
respect. Either they were revised to include additional observations, or the 
observations v;ere replaced by individual conferences in accompaniment with a 
self-evaluation model or procedures including performance objectives. All 
but a few districts v/ho used an observational method stipulated that the evalu- 
ator must confer privately with the teacher to discuss the assessment. 

Another 10 per cent of the sample rsported using individual conferences as 
the basic procedure in acquiring information relating to a teacher's performance. 
This procedure v;as especially evident in districts using a self -evaluation in- 
strument, or in districts where teachers v/ere evaluated on their success in a- 
chieving previously established objectives. Conferences apparently are employed 
only when there is more than a single method utilized in obtaining information 
concerning a teacher's performance. 

Concern seven ; In asking for districts* evaluation instruments, this survey 
sought first, to ascertain the nature of the criteria on v;hich a teacher is 
assessed, and secondly, to describe generally the various styles and forms of 
instruments in use. 

Teachers, according to the criteria listed in evaluation instruments, in pro- 
fessional growth booklets, or in guidelines distributed to teachers, were asses- 
sed most frequently on their instructional skills and personal characteristics. 
Approximately 75 per cent of the sample evaluated teachers using these criteria. 
Other criteria that commonly served as a basis for teacher evaluation included, 
in order of frequency: t>rof essional growth, rapport with fellow colleagues and 
staff, relationships with the community and parents, classroom management ability 
(if specified separately from instruction), and performance of general school 
services. Data indicated that pupil achievement was not used by any district as 
a criterion for assessing teacher performance effectiveness. As might be expected 
districts, in revising their evaluation programs to incorporate the assessment 
of performance objectives, have reduced the prior emphasis placed on a teacher's 
personal characteristics. 

Instruments submitted indicated that teachers continued to be rated in ap- 
proximately 55 per cent of the sample. Scales either using numbers (1-5) or 
descriptors (outstanding, excellent, average, weak) were typically employed. 
There was wide variation as to the number of evaluative criteria listed on the 
instrument, and the degree to which they were delineated into sub-items. Thus, 
on numerous instruments, "personal characteristics*' might be subdivided to spec if 1 
cally include emotional stability, health, voice, tact, appearance, etc., each 
of which would be rated. Slightly over one half of the rating instruments pro- 
vided additional space for the evaluator(s) to elabora*:e briefly on each item. 
In approximately 35 per cent of the sample, evaluators, rather than employing 
a rating instrument, furnished information on a specific comment or general nar- 
rative form. 



Conce rn eight ; In conclusion, this survey solicited administrators' comments 
and reactions regarding their districts* current attempts at assessing teacher 
performance . 
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Administrators vjho reaponded with personal opinions generally expressed 
reservations about their current practices of teacher evaluation. They con- 
veyed a number of common concerns such as the need, first, for more expertise 
by the evaluators, and secondly, for sufficient time to conduct a careful and 
meaningful assessment. Several responses dealt with the intrinsic value of 
the evalu n process--or lack thereof --as illustrated by this excerpt: 

. . te is, not V7anted in any building, have nothing but positive written 
evaluations . . Many were concerned about increasing the objectivity of the 
evaluative process by developing procedures for measuring the attainment of per 
formance objectives, thus, placing the greatest emphasis on instruction rather 
than on the personal attributes of the individual teacher. As one administra- 
tor remarked, ''The trend appears to be away from rating scales to a process 
based upon improvement of instruction/' Other comments reflected a concern for 
additional self-evaluation.'' Other comments reflected a concern for additional 
self-evaluation procedures and the need, regardless of a district's j:resent 
program, for continuous reappraisal of the evaluation program itself. In fact, 
as an optimistic note, a review and/or revision of teacher evaluation programs 
was reported by 90 per cent of the sample. 



CllAFTER IV 

SUMMARY AND DISCUSSION 

The objectives of this chapter are: (1) to summarize the problem and proce- 
dures used in the study; (2) to present conclusions; and (3) to provide a c '.s- 
cussion on the relevant concerns regarding teacher evaluation programs. 

Summary 

The purpose of this study was to gather data and information pertinent to 
teacher evaluation programs conducted in First Class School Districts in Wash- 
ington State. Data and information, submitted in response to a mailed question 
naire, dealt with a district's assessment policy, and specifically included 
concerns relating to evaluative personnel, procedural stipulations, and evalua- 
tive instrument or model. 

Approximately 60 per cent of the First Class Districts furnished information 
which served as the basis for the present study on the status of teacher evalu- 
ation programs as currently administered in these districts. 

Conclusions 

Listed below are the major conclusions of this survey, 

1. The majority of first class districts included in the sample reported 
utilizing a district-wide evaluation model in assessing teacher effectiveness. 
Data revealed that those districts which encouraged individual school develop- 
ment of evaluative programs tended to be relatively smaller in size. 

2. Instructional improvement x^as cited by most of the sample as the primary 
purpose for their teacher assessment program. However, several administrators 
expressed concern over the discrepancy that existed between the procedures and 
the Intended purpose. 
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3, Principals continued to serve as the primary evaluftors of teacher ef- 
fectiveness. Districts have reported increased use of peers as evaluators, and 
also in a few isolated instances, the use of pupil-evaluators . 

4, Observation uas the most frequently used method in obtaining information 
about a teacher's effectiveness. The survey indicated, however, the nev7ly de- 
veloped programs, rather than relying solely on observation, included the asses- 
sment of performance objectives and self -evaluation techniques such as video- 
tape, 

5, The criteria most commonly listed in teacher evaluation instruraents is 
that of personal character istic:^ and instructional skills. This result is 
qualified by districts who reported in their revised programs increased emphasic 

on instructional skills and it^erf ormance rather than on personal teacher attributes, 

6, The rating instrument prevailed as the most common assessment form. In 
conjunction with this instrument, the majority of districts required a conference 
with the individual teacher to discusc itar;is deserving commendation or needing 
improvement , 

?• An overwhelming majority of districts reported current teacher evaluation 
programs under review and/or revision. 

Discussion 

On the national level, there is concern over the effectiveness of public 
education. Reflecting this concern, the national assessment, headed by Ralph 
A, Tyler, is attempting to furnish overall information about the educational 
attainment of the general population. As the public and educators grow to ex- 
pect more from education, there is a need to know the status of education, both 
its strengths and weaknesses. This call for measuring educational effect ive- 
ness--ref erred to as accountability--'=^mphasizes the verifiable measurement of 
pupil achievement induced as a result of the instructional program. Although 
its singular intent is to provide information, any assessment of the schools 
contains broad ratifications. 

The incontiovertible trend toward product accountability, which utilizes a- 
chievement tests as standard assessment measures, creates many complications for 
the teacher, currently the primary focus of evaluation procedures. Many teachers 
feel uneasy about this hard data approach. They point out that achievement tests 
are not valid indicators of what they have taught, much of which is ephemeral 
and unretrievable at test time. Teachers, who are unable to control numerous 
variables including the home environment, a child's readiness, and the specified 
curriculum, resent being held accountable for learning when they cannot decide 
what is taught, nor, in many instances, how it is taught. A recent editorial 
in Saturday Review elaborated on this issue of product accountability: 

As we focus increasingly on pupil performance as a measure of teacher ef- 
fectiveness, however, it would be easy to forget the complexity of the 
learning process--that individual children are very different, that they 
learn different things at different rates, and that even the same child 
learns at different rates at different times. If, therefore, the laudable 
( effort to improve classroom practice by assessing teacher and school effec- 

tiveness merely results in placing more intense and sophisticated pressure 
on children to perform, the very principle x^ill be denied in practice, for 
^ if the concept means anything it is that the ultimate accountability must 

El\lC be to the children (10:41). 
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Yet a teacher must receive some feedback on V7hether children learn, otherwise 
she will not know if she is succeeding, nor in v;hat direction to proceed. The 
essential questions 'v7hat to measure' and *hov; to measure* remain. 

Other ramifications of accountability involve the public v;ho assume greater 
responsibility in sv.ecifying explicitly the schools* educational goals. This 
is no simple task, for parents - and educators--hold widely disparate viev7s on 
what are appropriate educational goals. Communities also must decide v/hether 
the resources provided to the schools are commensurate tc the goals they want 
accomplished . 

Thus, many fundamental questions arise as a result of the concern for .ac- 
countability : ''IJho is accountable?'* 'For v;hat are they accountable?'* and 
''Who bears the responsibility for conducting relevant and objective assessment 
programs?'* 

These questions are pertinent to our o\m State of Washington, v;here the 
Legislature has reflected the national trend toward accountability by mandating 
annual evaluation of certificated school personnel. The State lav/, however, 
has not resolved the questions surrounding accountability, and teacher evalu- 
ation, as indicated by this study, is not equivalent to accountability. While 
accountability measures product, teacher evaluation has focused on the presage 
and process variables, such as a teacher's personal characteristics or class- 
room management skills. While accountability aspires toward objectivity, 
teacher evaluation has remained essentially subjective, relying upon rating 
instruments and infrequent observation of the teacher by the evaluator. While 
accountability provides rigorous and continual reporting of school plans and 
performances, teacher evaluation has concentrated upon only a singular aspect 
of education, the role of the teacher. 

Too frequently, accountability has focused on the microcosm of the class- 
room rather than on the macrocosm of the school as a function of the communi- 
ty. In order to hold teachers accountable to the public for their performance, 
cer*:ain prerequisites must be fulfilled. These prior conditions follow i 
systematic sequence and involve a divevsification of the responsibility for 
the total school program. The prerequisites are: 

Communities, possibly through group process techniques, need to establish 
goal statements. Schools have operated too long accepting responsibilities 
for many of society's frailties. Communities must delineate the approp- 
riate goals or functions for its schools. 

These goals need to be translated into performance objectives. Teachers 
will then understand the basis for their assessment, and such objectives 
will add clarity to their specific assignments. 

Federal, state and local government agencies must commit themselves to 
making quality education available to all children, insuring that every 
individual V7ill learn according to his capacity. This commitment obvi- 
ously bears a concomitant responsibility to insure that quality education 
is not denied to pupils because of race, economic station, or financial 
ability of the local community to support quality education. 

Methods must be selected or developed by which the performance objec- 
tives will be achieved. This requires schools to inform the communities 
of resources needed to accomplish the established objectives. If resources 
(teachers, materials, community services, etc) are not available, then 



3. 
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the objectives should be reconsidered. 



5. Measures must be developed in order to accurately appraise attainment 
of objectives. 

6. Procedures need to be established for reporting information to the public 
in order to plan for subsequent instruction. 

This systematic process will require the combined efforts of the schools, communi- 
ties, and various agencies supporting public education. When each recognizes its 
specific responsibilities, teacher assessment x^ill assume its proper role as a 
segment of school accountability intent upon improving instruction • 
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